
What is claimed is; 

1. V method for profiling computer program executions 



n a computer processing system having a processor and a 
memory hierarchy j^'^mprising the steps of: 
5 executing a computersprogram; and 

storing, in a memory ar^y^^prof ile counts for events 
associated with the execution of tfte^^computer program, the 
memory array being separate and distincfe^f^m the memory 
hierarchy so as to not perturb normal operations of the 
10 memory hierarchy. 

2. The method ac'esfrding to claim 1, further 



comprising the step of updating the profile counts 




x v ^ 3 . ^Phe method according to claim 2 , wherein said 
i storing and updating steps are performed asynchronously to 
15 prevent a decreas^ofs^an execution speed of the computer 

program. 

4. The method according to claim 2, wherein said 
updating step is triggered by executionSof the events, 
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he method according to claim 2, wherein said 
updating step is triggered by execution of instructions 
embedded into\an instruction stream of the computer program. 

6. The method according to claim 2, further 
comprising the step\pf detecting whether a profile count has 
exceeded an adjustable predefined threshold. 



7. The method according to claim 2, further 
comprising the step of indicating when a profile count has 
exceeded an adjustable predefined threshold. 

8. The method according \to claim 7, wherein said 
indicating step comprises the step of raising an exception, 



9 . The method according to claim 2 , further 
comprising the steps of: 

accumulating profile updates; and\ 

dividing the accumulated profile updates by a threshold 
fraction. 



10. The method according to claim 2, further 
comprising the step of scaling the profile counts to prevent 
profile information overflow. 
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11. 

'""X 



he method according to claim 2, further 
comprising the step of identifying profile information 
corresponding to the profile counts using a profiling event 
identifier . 



12. The method according to claim 11, further 
comprising the step^of addressing the memory array, using 
the profiling event identifier. 



13. The method acoprding to claim 2, further 
comprising the steps of ; 

10 generating the prof ileXcounts using profile counters 

associated with the events; and 

maintaining the profile counters in a set-associate 
manner. 

14. The method according to cl^im 13, further 
15 comprising the step of selecting a profile counter to be 

evicted from the memory array based upon V predefined 
replacement, when a number of profiling events assigned to 
an associative class of events is exceeded, 
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15. ^ The method according to claim 14, wherein the 
replacemeA^ strategy is based upon one of 
least-recently-used and first-in-first-out. 



16. The Vnethod according to claim 2, further 
comprising the step of supporting read operations from the 
profile matrix i A an off-line optimization of the program. 

17. The method according to claim 2, further 
comprising the step of assisting at least one of compilation 
and optimization of the program, based upon the profile 
counts stored in the profile matrix. 

18. The method according to claim 17, wherein said 
assisting step is performed during at least one of dynamic 
binary translation and dynamic\ optimization of the computer 
program . 



19. The method according to claim 18, wherein the 
dynamic binary translation and dynama^c optimization of the 
computer program results in translated\and optimized code, 
respectively, the translated and optimized code comprising 
instructions groups which pass control therebetween. 
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) q\^s> 20. The method according to claim 19, further 

'comprising the step of identifying frequently executed paths 
of the computer program, by instrumenting exits from the 
instruction groups with a profiling instruction that 
indicates a u^que group exit identifier. 

\ 

\ 

21. The method according to claim 19, further 
comprising the st\p of extending the instruction groups 
along a frequently executed path. 



22. The method according to claim 1, wherein the 
memory hierarchy include^ data and instruction caches, and 
the memory array is separate and distinct from the memory 
hierarchy so as to not percurb normal operations of the data 
and instruction caches. 



23. An apparatus for profiling computer program 
executions in a computer processing system having a 



r , t\e 



processor and a memory hierarchy, tlie apparatus comprising: 

a memory array adapted to store profile counts for 
events associated with execution of the\computer program, 
said memory array being separate and distinct from the 
memory hierarchy so as to not perturb normal^ operations of 
the memory hierarchy; and 
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a! controller adapted to select the events and to update 



X 



the profile counts stored in said memory array. 



24. The apparatus according to claim 23, wherein said 
memory arrayv and said controller are adapted to 
asynchronously* store and update the profile counts, 
respectively, to prevent a decrease of an execution speed of 
the computer program. 

25. The apparatus according to claim 23, wherein said 
controller is adapted op update the profile counts as the 
events are executed. 

26. The apparatus according to claim 23, wherein said 
controller is adapted to updat^ the profile counts based 
upon instructions embedded into ^n instruction stream of the 
computer program. 



27. The apparatus according to claim 23, further 
comprising a comparator circuit adapted Vo detect whether a 
profile count has exceeded an adjustable predefined 
threshold. 
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28. Yhe apparatus according to claim 23, further 
\ 

comprising ap. indicating circuit for indicating when a 
profile count\has exceeded an adjustable predefined 
threshold. 



29. The apparatus according to claim 28, wherein said 
indicating circuit isy adapted to raise an exception when the 
profile count has exceeded the adjustable predefined 
threshold. 



10 



30. The apparatus according to claim 23, further 
comprising: 

an accumulation circuit adapted to accumulate the 



15 



updated profile counts; and 

a dividing circuit adapted to divide an accumulated 
value of the updated accumulated profile counts by a 
threshold fraction. 



31. The apparatus according to claim\23, further 
comprising a scaling circuit adapted to scale the profile 
counts to prevent profile information overflow 
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32. Tttie apparatus according to claim 3, wherein 



profile infoVmation corresponding to the profile counts is 
identified using a profiling event identifier. 



33. The apparatus according to claim 32, wherein the 
memory array is addressed using the profiling event 
identifier. 

34. The apparatus\according to claim 23, further 
comprising profile counters for generating the profile 
counts, said profile counters being associated with an event 
in a set-associate manner. 

35. The apparatus according to claim 14, further 
comprising a replacement circuit\adapted to select a profile 
counter to be evicted from the memory array based on a 
predefined replacement strategy, when a number of profiling 
events assigned to an associative c]\^s£^is exceeded. 

36. The apparatus according to claim 15, wherein the 
predefined replacement strategy is based upon one of 
least-recently-used and first-in-first-out 
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\ 37. ^he apparatus according to claim 23, wherein the 
s memory hierarchy includes data and instruction caches, and 
said memory arr^y^ is separate and distinct from the memory 
hierarchy so as to > ^not perturb normal operations of the data 
and instruction caches. 



38. The method according to claim 1, wherein said 
method is implemented by a program storage device readable 



-9 \pr< 



by machine, tangibly embodying k program of instructions 
executable by the machine to perform said method steps. 
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